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Abstract 

After an overview of the modular analysis and reconstruction framework 
Marlin an introduction on the functionality of the Marlin-based recon- 
struction package MarlinReco is given. This package includes a full set 
of modules for event reconstruction based on the Particle Flow approach. 
The status of the software is reviewed and recent results using this soft- 
ware package for event reconstruction are presented. 
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1 Introduction 

The International Linear Collider (ILC) will be the next machine beyond the 
LHC. It allows to explore the physics at the 500 GeV to 1 TeV energy scale with 
high precision. Sophisticated simulation and reconstruction software supports 
the ongoing development and optimisation of a detector for the ILC. Fig. \T\ (a) 
shows a schematic overview of the core software chain used for the studies of 
the Large Detector Concept (LDC), one of the four current detector proposals 
for the ILC pQ. This chain consists of two major parts: 

1. The Geant4-based simulation of the detector response, Mokka [21 [3] • 

2. The digitisation of simulated data, event reconstruction as well as analysis 
provided by different modules of the Marlin framework [HH]. 

The event data is shipped through the software chain using LCI0 [5] between 
different programs. Geometry related data needed by the reconstruction is 
provided by the full detector simulation and can be accessed via Gear [3]. 
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2 Marlin and MarlinReco 



Marlin is a modular C++ application framework based on LCIO for the analysis 
and reconstruction of ILC data. Marlin provides the main program with the 
event loop and a mechanism to call modules, so called processors, to carry 
out specific tasks. These tasks can be as simple as filling histograms or as 
complex as a full event reconstruction. As an example, the full chain of an 
event reconstruction with processors for digitisation, tracking etc. is shown in 
Fig. [T] (b). The framework is reading data event by event creating an LCEvent 
which is used to hand the data from processor to processor during a Marlin 
run. An LCEvent consist of a set of collections holding specific objects like 
hits, tracks, cluster etc. Processors only have the permission to read and add 
information to ensure the consistency of the data. Program steering is done via 
an XML file allowing to hand over processor parameters, specifying the order 
of the processors or exchanging processors without recompilation. The package 
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Figure 1: LDC simulation and reconstruction framework (a), structure of 
Marlin (b) 

MarlinReco is a specific set of processors for a complete event reconstruction 
system, based on the Particle Flow concept. Version 00-02 contains the following 
processors: 

Tracker Hit Digitisation: For the Vertex system there are two different digi- 
tisers available. A simple digitiser translates simulated tracker hits into 
tracker hits, without modifications. A more sophisticated digitiser takes 
the deposition and transfer of charge in silicon as well as the geometry into 
account [B]. In the Time Projection Chamber (TPC) a Gaussian smear- 
ing of the simulated hit positions in r-<f> and z is done to account for the 
intrinsic chamber resolution. Their parameters are obtained from Gear. 

Calorimeter Hit Digitisation: There are two different digitisers for the elec- 
tromagnetic and hadronic calorimeter (ECAL and HCAL). The first pro- 
vides calibration, low energy hit rejection and various sampling fractions 
for the different regions of the calorimeter. The second has the capability 
to merge neighboring cells into larger cells. This feature allows the varia- 
tion of the cell size in a simple way. Both digitisers are able to treat hits 
in analogue and digital calorimeters. 

Tracking: There are two tracking processors. The first is based on algorithms 
taken from LEP providing full tracking in the TPC with energy loss and 
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multiple scattering [7]. The hits of the inner silicon detectors can be 
included in the track fit, using the tracks reconstructed in the TPC as 
seeds. The second processor provides a stand-alone pattern recognition 
procedure for the vertex detector [§]. 

Clustering: One of the central parts in the Particle Flow approach is a sophis- 
ticated procedure to assign calorimeter hits to the proper reconstructed 
particle and to minimise the "confusion" between adjacent particles. This 
so-called clustering is done by the "Trackwise Clustering" |8 algorithm. 
It only relies on spatial information of the calorimeter hits with mini- 
mal dependence on the detector geometry. It is applicable to digital and 
analogue calorimeters as well as to different detector designs. 

Particle Flow: MarlinReco's Particle Flow processor "Wolf" extrapolates the 
tracks into the calorimeter and matches them to clusters by a proximity cut 
taking into account the detector geometry In addition, a simple particle 
identification is done by calculating the fraction of energy in the ECAL 
and the HCAL. After that, a collection of reconstructed particles is created 
where the four momenta of charged particles are determined by the track 
parameters. The four momenta of neutral particles are calculated from 
the clusters only. 

Track and Cluster Cheater: Processors allowing the assignment of hits to 
tracks and clusters, using Monte Carlo information only, are provided. To 
obtain the track parameters either a simple helix hypothesis is fitted to 
the tracker hits or the information is taken from the Monte Carlo directly. 

Analysis: There are processors to calculate the thrust axis and value (Tasso 
and JETSET algorithm [HUH]) as well as the sphericity and aplanarity 
of an event. In addition, a multi-algorithm jet finding processor is avail- 
able [TT]. 

Calibration: This processor calculates the calibration constants for the calo- 
rimeter by the method proposed in [15] . It is based on the energy conser- 
vation law giving an upper limit for the energy sum in all the cells of the 
calorimeter. 

MarlinReco is based on the package MarlinUtil combining utility and 
helper classes and by the client-server based event display system CEDViewer. 
RAIDA, a ROOT implementation of the AIDA interface, is available [HIE]. Due 
to the modular structure and the well defined data structures, alternative al- 
gorithms (Magic, PandoraPFA [T5J [TB]) can easily be included in MarlinReco. 
All software packages as well as more detailed documentation can be accessed 
via [3]. 

3 Event Reconstruction 

Here the MarlinReco-based event reconstruction is tested and the dependence of 
the performance of the Particle Flow on basic geometric properties of the detec- 
tor is studied. For this purpose the full detector simulation using Mokka v05.04 
and event reconstruction with MarlinReco is done with four classes (7/Z — > qq, 
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WW and Zh — > 4 jets as well as tt — > 6 jets) of events at four different center-of- 
mass energies (91.2, 360, 500 and 1000 GeV). Four different layouts of the LDC 
and two values of the magnetic field have been chosen to optimise the detector. 
For the variation of the detector geometry two detector models, LDCOOSc and 
LDCOISc provided by Mokka, with different sampling structures in the ECAL 
are chosen. For each model two sizes of the TPC, determined by their outer 
radius (Rtpc) and length from the nominal IP to the end plane of the TPC 
(Ltpc), are constructed ((A) and (B) in Tab. [T]). This results, together with 
the two values of the magnetic field (3, 4 T) in eight detector layouts. Tab. [T] 
summarises the available geometries. Computing and data storage for Simula- 



model 


LDCOISc 


LDCOOSc 


variation 


(A) 


(B) 


(A) 


(B) 


Rtpc (mm) 


1380 


1580 


1690 


1890 


L T pc (mm) 


2000 


2200 


2730 


2930 



Table 1: Layouts of the LDC simulated with Mokka v05.04. The four detector 
geometries are available with a magnetic field of 3 and 4 T. 

tion and reconstruction has been done using GRID resources. Meta information 
about the simulated data as well as the logical names to access the files arc avail- 
able through a database [3]. For this study Monte Carlo information has been 
used to perform the pattern recognition in the tracking system. A helical fit is 
applied afterwards to obtain the track parameters. 

In Fig. [2] (a) the distribution of the invariant mass of Z° measured with 
Z° — > uds events at yfs — mz for the detector LDCOOSc (A) with a magnetic 
field of 4 T is shown. Due to the difference of the tails compared to a Gaussian 
distribution the root-mean-square (RMS) is not an appropriate measure of the 
width of the peak and therefore of the performance of the reconstruction. Hence, 
(1) the RMS is calculated with the bins around the maximum bin containing 
90% of the events (RMS90) PS], and (2) the sum of two Gaussian functions, 
one for the central part and one for tails, is fitted to the mass distribution. 
The width of the central Gaussian (a c ) is the measure of the width of the 
peak (see Fig. [2] (a)). The results of both methods are quoted to show the 
performance of the reconstruction. They also act as an indicator in the process 
of detector optimisation. The results for the geometry listed in Tab. [T] are 
shown in Fig. [5] (b). To study the performance at higher energies, a simple 
analysis of tt — > 6 jets at yfs — 500 GeV has been performed by calculating 
AE reco := J2i E reco _ J2i E lvaii for eacn event. In the first part of AE reco the 
energies of all reconstructed particles are added up, while in the second part the 
energy sum of all Monte Carlo particles which pass the acceptance cut 9 > 0.1 
and which are not neutrinos is calculated. This results in A E rcco = 25.2 GeV 
which is about a factor of two larger than the pure calorimeter resolution given 
by AE calo := £ ( E* alo - £ 2 E avail = 12.6 GeV, where the first part of AE calo 
adds up the energy of all calorimeter cells [12]. One reason for this decrease of 
performance compared to Z° — ► uds at y/s = mz is the misassignment of hits 
due to overlaps of showers in the calorimeter. The Fortran-based simulation 
and reconstruction package Brahms has shown that it is possible to reach energy 
resolutions of about 9 GeV for it-events at y/s — 500 GeV following the Particle 
Flow concept HE]. 
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Figure 2: Reconstructed invariant mass of Z° — > uds at y^i = mz fitted with 
a sum of two Gaussians (a), performance of MarlinReco (RMS90 and a c ) for 
different detector geometries (see Tab.[T]) and magnetic fields (b). 

4 Conclusions 

For the determination of a c for Z° — * uds at y/s = mz MarlinReco comes close to 
the performance goal of the jet energy resolution at the ILC {ge/E = 0.30/-\/E 
corresponding to erg = 2.9 GeV at *Js — mz) but no significant dependence on 
the detector geometry is observed. The results of the RMSgo-method are consid- 
erably larger but are showing a clear dependence on the detector geometry. In 
addition, this dependence follows the expectation, i.e. the resolution increases 
with a larger detector and a larger magnetic field (see Fig. [H (b)). The analysis 
of tt — > 6 jets at y/s = 500 GeV shows that improvements in MarlinReco are 
necessary, especially for high center-of-mass energies. Nevertheless, MarlinReco 
provides the full chain of event reconstruction following the Particle Flow con- 
cept. 
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